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ABSTRACT: The increasing importance of meat quality has implications for animal breeding programs. Research has revealed much 
about the genetic background of pigs, and many studies have revealed the importance of various genetic factors. Since meat quality is a 
complex trait which is affected by many factors, consideration of the overall phenotype is very useful to study meat quality. For 
integrating the phenotypes, we used principle component analysis (PCA). The significant SNPs refer to results of the GRAMMAR 
method against PCI, PC2 and PC3 of 14 meat quality traits of 181 Duroc pigs. The Genome-wide association study (GWAS) found 26 
potential SNPs affecting various meat quality traits. The loci identified are located in or near 23 genes. The SNPs associated with meat 
quality are in or near five genes (ANKl, BMP6, SHH, PIP4K2A, and F0XN2) and have been reported previously. Twenty-five of the 
significant SNPs also located in meat quality-related QTL regions, these result supported the QTL effect indirectly. Each single gene 
typically affects multiple traits. Therefore, it is a useful approach to use integrated traits for the various traits at the same time. This 
innovative approach using integrated traits could be applied on other GWAS of complex-traits including meat-quality, and the results 
will contribute to improving meat-quality of pork. (Key Words: Genome-wide Association Study, Principle Component Analysis, Meat 
Quality, Duroc Pig) 



INTRODUCTION 

As consumers seek tastier, healthier, safer and more 
nutritious pork, the quality of the meat has been improved. 
As pork quality is influenced by many genetic and non- 
genetic factors, many studies have been focused on 
reveahng the genetic background and its various genetic 
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factors (Rosenvold and Andersen 2003; Hu et al., 2013). 
These studies have shown that selective breeding of pigs 
and using DNA markers could play important roles in 
enhancing pork quality. Meat quality traits are also affected 
by many genes that are mapped in the quantitative trait loci 
(QTL) regions. Therefore, a great deal of research related to 
farm animals has focused on mapping and characterization 
of trait loci, and genome research (Andersson 2001; 
Andersson and Georges, 2004). Genetic variants as a part of 
genetic factors are also used to identify contribution to 
complex traits. Even within breeds, there is considerable 
genetic variation in relation to meat quality traits such as pH 
and intramuscular fat. Part of this variation is due to genetic 
differences. It therefore follows that DNA marker 
technology is important in improving meat quality (De 
Vries et al., 2000). 

Genome-wide association study (GWAS) uses genetic 
variants with interesting traits. Since meat quality is a 
complex trait affected by many factors, consideration of the 
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several traits is vital. Although there have been many 
GWAS for pork quality traits, there has been few studies of 
multiple traits of meat quality (Luo et al., 2012; Ma et al., 
2013). The few studies of multiple traits, used several traits, 
but each trait was analyzed separately then the results were 
considered together. Most research has been performed on 
single traits, where we performed a multiple trait analysis to 
find candidate genes of good meat quality by integrating the 
phenotypes. The overall represented phenotype related to 
meat quality obtained by principal component analysis 
(PC A) in this study is defined as the integrated phenotype. 
The integrated phenotypes are representative of overall 
meat-quality phenotypes and are non-redundant information. 
However, some unique information obtained from several 
phenotypes may be important, since the phenotypes are 
correlated and partially redundant. Therefore a study of the 
common genetic associations among these phenotypes may 
provide a more complete understanding of the mechanisms 
involved in the production of good-quality meat. PCA can 
extract non-redundant information among the correlated 
traits with the goal being to extract as much variance with 
the fewest components. PCA makes new variables, 
principal components (PCs), to describe the data in a more 
concise or convenient way. PCA explains most of the 
variance of the data by extracting PCs from a specific Unear 
combination of the original phenotypes and reducing the 
dimensionality of a data set which are correlated. An earlier 
study using PCA to integrate phenotypes successfully found 
the variants associated with osteoporosis (Karasik et al., 
2012). 

To detect candidate genetic variants associated with 
various meat quality traits, we considered meat quality in 
relation to all phenotypes together, and reduced the 
dimensions of these phenotypes by PCA in the the Duroc 
breed, which has been identified as a superior genetic 
source for improving the eating qualities of pork according 
to the recent National Pork Producers' Council (NPPC) 
Terminal Sire-Line Genetic Evaluation Program (Goodwin 
and Burroughs, 1995). 

MATERIALS AND METHODS 

Samples and phenotypes 

In this study, we used a two-generation resoiu'ce 
population of Duroc pigs. Duroc boars were mated with 106 
Duroc sows. The 181 female Duroc pigs from four farms 
were humanely killed at 184±16.3 (mean+SD) days in 10 
batches (slaughter groups). The genomic DNA samples 
were genotyped using the lUumina Porcine 60K SNP 
BeadChip (Illumina Inc., USA). After slaughtering, meat 
quality traits of the longissimus muscle were evaluated. All 
muscle samples were taken from the same position of 
longissimus dorsi, between 10th and 11th rib, from one 



slaughter batch. The information of meat quality 
measurement is described in Supplementary Table SI, S2. 
We measured 14 meat quality traits, including meat pH 24 h 
post-mortem (pH[24]), meat color (CIE-L*, CIE-a*, CIE- 
b*), shear force (SF), drip loss, heat loss, fat (intramuscular 
fat content), water-holding capacity (WHC), back-fat 
thickness (BE), protein, cholesterol, moisture and ash 
contents. PCA was conducted for these 14 phenotypes using 
R, and correlations between the phenotypes were calculated 
with the corrplot R package (Friendly, 2002). 

GRAMMAR: step 1 

We performed PCA analysis and got three integrated 
phenotypes (PCs) related to meat quality, which were then 
used to estimate meat quality. Residual in Genome-wide 
rapid association using mixed model and regression 
(GRAMMAR) (Aulchenko et al., 2007) was used as a 
phenotype that reflected the genetic factors. 

Y = Wv-HZa-He, 

where Y = the vector of each observations (3 PCs), v = 
the vector of fixed effect as age on the slaughter day, a = the 
vector of the random effect as relationship matrix and e = 
the vector of residual error. W and Z are design matrices for 
V and a, respectively. 

Observations comprised a total of three PCs of 181 pigs. 
As all 181 pigs were female, we only used age as a 
covariate. The relationship matrix Z, based on pedigree 
information, included the parents of the 181 pigs. Residual 
(e) contains the environmental effect along with a major 
part of the genetic effect. All parameters were estimated 
using the ASREML program (Gilmour et al., 2009) 
faciUtated by the AIREML algorithm for a single-trait 
animal model. Using this model, we calculated the residual 
for the phenotype related to meat quaUty used in this 
GWAS. 

GRAMMAR: step 2 

The Illumina Porcine 60K SNP BeadChip contains 
62,163 SNPs that are distributed across the 18 pig 
autosomes and the X chromosome. We excluded all SNPs 
with a miss rate of >5%, a minor allele frequency (MAF) of 
<0.05, and a Hardy- Weinberg equilibrium (HWE) test 
p-value of <10~''. After quality control, we retained 45,620 
SNPs for further analysis. Association analysis of the SNPs 
with a residual for PCI, PC2, and PC3 were conducted on 
the basis of a linear regression using PLINK 1.07 (Purcell et 
al., 2007). The SNPs were mapped on the Sus scrofa 
genome 10.2 assembly, version 2 from NCBI FTP, using 
BEAT (Kent, 2002). 

Genome-wide significance was defined based on the 
LD-adjusted Bonferroni method (Duggal et al., 2008) and 
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Figure 1. Pairwise phenotypic correlations between traits. A) Shiowing 14 meat traits: Ash, moisture, meat pH24, fat (intramuscular fat 
content), meat colour (CIE-L*, CIE-a*, CIE-b*), WHC (water holding capacity), SF (shear force), back fat thickness (BE), protein, 
cholesterol, drip loss, heat loss. B) Here the three PCs were added to the results of the PCA, and the 14 phenotypes along with PCI, PC2, 
and PC3 are shown. The phenotypic correlations between traits in similar phenotypes are stronger than others. The congelation values are 
represented as colours, where range of colours (from red to blue) shows the range of congelation values. The blue colour represents 
positive conelation between the phenotypes, the red colour means negative conelation between the phenotypes. The order was 
determined by hierarchical clustering method. 



simple BonfeiToni method to correct p-value thresholds of 
significance: significant association of 0.05 false positives 
per GWAS was used as a genome- wide significance. Duroc 
pigs are estimated to have 6,948 "independent" tests based 
on the Gabriel-type block (Gabriel et al., 2002), as 
calculated by Haploview (Barrett et al., 2005). Therefore, 
7.2E-06 (0.05/6,948) has genome-wide significance. An 
overview of the p-values of tests using Manhattan plots was 
produced by R. Significant SNPs were mapped on to the 
meat-quality QTL regions of the pig genome, assembly 
version 10.2 (Hu et al., 2013). 

RESULTS 

PCA of meat-quality related phenotypes 

Summary of phenotypes measurement are described in 
Supplementary Table SI, S2. There was a correlation 
between phenotypes (Figure 1 a), which shows the results of 
a PCA of the meat quality-related phenotypes. The three 
PCA components jointly comprised 52.3% of the total 
variability of meat quality-related phenotypes 
(Supplementary Figure SI). PCI, which accounted for 
21.9% of the total variance, includes: pH, moisture, fat 
content, and meat color components. PCI weights were 
very similar in these factors (Figure lb). PC2, which 
accounted for 15.5% of the variance, includes SF, BF and 
protein components (Figure lb). PC3, which accounted for 



14.8% of the total variance, includes: moisture, drip loss, 
and heat loss components (Figure lb). To fully consider 
pork quality, we used PCI, PC2, and PC3, because these 
components explained a large proportion of the total 
variability in meat quality-related traits. 

SNP-association studies for integrated phenotypes 

Using the GRAMMAR method (Aulchenko et la., 2007), 
we identified the top 26 and 11 SNPs associated with 
integrated meat quality in the Duroc breed at the 
significance level of 7.2E-06, which equals an LD-adjusted 
BonfeiToni p-value of 0.05 and a simple Bonferroni 
significance level of 1.096E-06 (Table 1). The inflation 
factor X for PCI, PC2 and PC3 of pig were 1.646, 1.691, 
and 1.173, respectively (Supplementary Figure S2). These 
were corrected results for structure/familial effects by using 
the G matrix. The number of significant SNPs in the genie 
regions was roughly equal to those in the inter-genic 
regions. Fourteen significant SNPs were associated with 
PCI (six in genie regions, eight in intergenic regions), 
eleven SNPs were associated with PC2 (six in genie regions, 
five in intergenic regions), and two SNPs were associated 
with PC3 (one in a genie region, one in an intergenic 
region). The coverage of genie and intergenic regions were 
roughly 1:3 (25.8 to 74.2%; 637 to 1,838Mb). When 
considering the coverage, the number of significant SNPs in 
genie regions was significantly greater than that in the 
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Table 1. Top SNPs associated with PCs in the Duroc pig breed 



PC CHR SNP Position , . MAP" BETA*' p-value Nearby gene 

/major 



PCI 2 


ASGA0010593 


85596454 


G/A 


0.425 


0.013 


5.56E-06 


SNORA70 


7365 


2 


MARC0001645 


87106234 


A/G 


0.384 


0.015 


1.35E-06 


ENSSSCGOOOOOO 

14087 


Genie 


2 


UMB 10000069 


87305603 


G/A 


0.365 


0.015 


7.70E-07 * 


ENSSSCGOOOOOO 
25645 


Genie 


2 


M1GA0002981 


87507128 


A/G 


0.392 


0.014 


4.58E-06 


AGGFl 


Genie 


2 


MARC0023062 


87571413 


G/A 


0.394 


0.014 


6.26E-06 


SNORA47 


-2363 


2 


ASGA0010696 


87821516 


G/A 


0.403 


0.014 


4.45E-06 


PDE8B 


Genie 


17 


INRA0052583 


12436823 


A/G 


0.428 


0.013 


5.48E-06 


ANKl 


-17485 


17 


ASGA0075294 


12781032 


A/G 


0.483 


-0.013 


6.99E-07* 


C8ORF40 


31476 


17 


ASGA0075297 


12798139 


G/A 


0.431 


-0.016 


6.92E-08* 


C8ORF40 


14369 


17 


DIAS0003526 


12823893 


G/A 


0.431 


-0.016 


6.92E-08* 


C8ORF40 


Genie 


17 


ALGA0093121 


12877758 


A/G 


0.392 


-0.016 


5.24E-08* 


C8ORF40 


-53786 


17 


ASGA0075278 


12896953 


C/A 


0.326 


-0.016 


2.04E-07* 


ENSSSCGOOOOOO 
07025 


45324 


17 


ASGA0075277 


13015930 


G/A 


0.392 


-0.016 


5.24E-08* 


SLC20A2 


Genie 


PC2 7 


ASGA0030822 


5278629 


C/A 


0.328 


-0.015 


1.09E-06* 


BMP6 


Genie 


8 


ASGA0093827 


76550202 


A/G 


0.086 


-0.021 


1.09E-06* 


SHROOM3 


Genie 


9 


ASGA0095397 


17376220 


A/G 


0.108 


-0.021 


1.20E-06 


ENSSSCGOOOOOO 
14896 


-88,4312 


10 


MARC0020616 


57676836 


A/G 


0.050 


-0.029 


1.32E-06 


ARMC3 


Genie 


10 


ASGA0048292 


57954471 


G/A 


0.050 


-0.029 


1.32E-06 


PIP4K2A 


Genie 


10 


ASGA0048295 


57985436 


A/G 


0.047 


-0.030 


1.12E-06 


PIP4K2A 


Genie 


18 


MARC0096007 


2334006 


A/G 


0.182 


-0.015 


2.30E-06 


SHH 


330788 


18 


MARC0019932 


7383149 


A/G 


0.146 


-0.017 


6.93E-06 


ENSSSCGOOOOOO 
30537 


3947 


18 


MARC0078684 


7813561 


A/G 


0.138 


-0.020 


2.53E-07* 


TRBV21 


Genie 


18 


H3GA0050297 


7848619 


G/A 


0.138 


-0.020 


2.53E-07* 


TRBV27 


-7443 


X 


MARC0070937 


44930506 


G/A 


0.146 


-0.017 


2.86E-06 


ENSSSCGOOOOOO 
26984 


-125815 


PC3 3 


ASGA0015307 


98152224 


A/G 


0.150 


-0.016 


5.57E-06 


F0XN2 


-5059 


16 


DRGAOO 16097 


36203829 


A/G 


0.072 


0.025 


5.45E-06 


ENSSSCGOOOOOO 
16898 


Genie 



" MAF = Minor allele frequency. BETA = Linear regression coefficient. 

' Type = 'Genie' means the SNP is located in genie region, value indicate direction and distance (bp) from the SNP to near genie region. 
* <Simple Bonfferoni significance level (1.096E-06). 



intergenic regions. This may suggest that intergenic regions, 
although containing important regulatory sequences, have 
the smallest ratio of functional to total DNA. 

Whether the SNPs significantly associated with 
integrated meat-quality phenotypes were located in meat 
quality-related QTL regions was determined (Figure 2 and 
Table 1). Twenty-five of the 26 SNPs were located in meat 
quality-related QTL regions, such as back fat thickness, 
intramuscular fat, protein, shear force, meat color (L*, a*, 
b*), pH, heat loss, and drip loss (Supplementary Table S3) 
(Hu et al., 2013). The exceptions were MARC0070937 
(ENSSSCG00000026984, SSCX: 44930506) and 
MARC0096007 (near SHH, SSC18: 2334006). Many of the 
significant SNPs were in high Unkage disequilibrium (LD) 



with each other, suggesting that they are inherited as a 
group (Figure 3, Supplementary Figure S3- 10). 

DISCUSSION 

By applying PCA to the correlation-coefficient matrix 
of related phenotypes, we were able to extract essential and 
non-redundant information. Multiple trait analysis is a more 
useful method of locating candidate variants than multiple 
single trait analysis in the context of systems genetics, since 
a summary of several correlated traits provides a 
comprehensive view of the genetics of meat quality. Thus, 
we found the locus which influences multiple correlated 
traits by applying PCA on multiple traits. We approximated 
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Figure 2. Manhattan plots of GWAS results for the traits and QTL mapping. GWAS for the integrated phenotypes using illumina Porcine 
60K SNP BeadChip of 181 Duroc samples. Each panel A), B) and C) show GWAS results against residual of PCI, PC2, and PC3. The x- 
axis of the Manhattan plot shows the genomic position, the y-axis represents the loglO base transformed p-values, LD-adjusted 
Bonferroni significance levels (7.2E-06) was applied. Meat-quality-related QTL regions located in significant SNPs are shown. 




Figure 3. Example result of linkage disequilibrium around the significant SNPs of eigenvector 1 PCA results on chromosome 2. Linkage 
disequilibrium (LD) was determined using Haploview, and LD blocks were defined using the Gabriel rule. Darker boxes represent higher 
LD percentages. Pairwise D' values ('100) are indicated; blue blocks indicate D' = 0. 
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the effective number of independent SNPs across the 
genome, and applied a Bonferroni LD-adjusted significance 
(7.2E-06). We corrected our figure to produce an accurate 
number of tests to determine the regions of the genome that 
should be studied further. The inflation factors seemed to be 
high, but we corrected for the population stratification in 
this analysis using the GRAMMAR approach. Yang et al. 
2011 showed that the magnitude of genomic inflation 
factors depends on sample size, heritability, linkage 
disequilibrium structure and the number of causal variants 
under polygenic inheritance even when there is no 
population structure. 

The 26 SNPs associated with integrated meat-quality 
phenotypes and the loci identified are located in or near 23 
genes (Table 1). Among them, ANKl, BMP6, SHH, 
PIP4K2A, and F0XN2 are known to be strong candidates 
for meat quality. It has been suggested that SHH and BMP6 
contribute to both juiciness and tenderness of the meat 
(Fonseca et al., 2003). MARC0096007 (SSC18: 2334006) 
and ASGA0030822, (SSC7: 5278629) are located near SHH 
gene and BMP6 gene, respectively. The SHH and BMP6 are 
members of the hedgehog signaUng pathway. The hedgehog 
family of proteins and, SHH in particular, may be important 
in slow oxidative (SO) fiber clustering in pigs. Recent 
evidence suggests that fiber-type proportions play a role in 
the quaUty of meat, and in pigs it has been suggested that 
the SO fibers contribute to juiciness and tenderness 
(Fonseca et al., 2003). 

F0XN2, ANKl, and PIP4K2A are related to meat 
quahty 1NRA0052583 (SSC17: 12436823) and 
ASGA0015307 (SSC3:98 152224) are located near ANKl 
gene and F0XN2 gene, respectively. ANKl gene was first 
discovered in erythrocytes, but has since also been found in 
brain and muscle tissues. ANKl is known to be significantly 
associated with many meat-quality-related traits (water- 
binding capacity, drip loss, heat loss, pH, IMF, and shear 
force) (Wimmers et al., 2007). ANKl and F0XN2 are 
related to type 2 diabetes (Wolfrum et al., 2004; Tabassum 
et al., 2008; Imamura et al., 2012). SNPs (ASGA0048292 
and ASGA0048295, SSCIO: 57954471, 57985436) are 
located within an intron of PIP4K2A, which is an important 
gene for determining whole-body insulin responsiveness, 
and mutation of the PIP4K2A gene could increase insulin 
sensitivity of skeletal muscle without any significant change 
in adipose tissue (Xu et al., 2012). This leads indirectly to a 
decrease in transport of glucose into white adipose tissues 
and reduced adiposity, because the increased insulin- 
stimulated glucose transport in the muscle leads to a lower 
basal insuUn-secretion requirement. PDE8B, located near 
SNP of ASGAOO 10696, is a major regulator of an adrenal 
steroidogenesis (Tsai et al., 2011) and modulates basal 
corticosterone synthesis (Tsai and Beavo, 2011). 
Corticosterone can affect meat quality by changing pre-and 



post-slaughter muscle metabolism (Gao et al., 2008). 

One Umitation of this study was the small sample size, 
which resulted in low power, and we found a small number 
of significant SNPs. Despite the limitations, this study does 
have its strengths. Even though this data has low power, 
significant SNPs were detected and they are related with 
meat quahty. Meat quality is a complex trait that is affected 
by many factors. Therefore, consideration of the overall 
phenotype is a useful approach to evaluate meat quaUty. 
This study also used many phenotypes, and extracted non- 
redundant and integrated information from correlated 
phenotypes. Other studies have analyzed a single phenotype, 
or even if several phenotypes were considered, they were 
analyzed separately then considered together. Unhke these 
other studies (Luo et al., 2012; Ma et al., 2013), we found 
some SNPs possibly associated with all phenotypes when 
analyzed together using PCs. The significant SNPs in or 
near genes seemed to be related to several meat-quality 
phenotypes. 

The 21 genes among our findings are also located in 
meat quality QTL regions. Most of the significant SNPs are 
located in LD block. The SNPs tag the block regions which 
are related with phenotypes and locate within the previously 
reported QTL regions, and thus the SNPs may contribute 
these complex traits. A QTL region is generally large, 
therefore fine mapping is needed. In this context, our 
findings narrow down the meat quality QTL regions. There 
are limitations that the traits are highly correlated so that 
they were possibly influenced by the same gene. Also, the 
LD blocks are large containing several genes, so we cannot 
exclude closely linked genes that affect the different traits. 
However, this innovative approach using integrated 
phenotypes can be applied on other GWAS of complex- 
traits including meat-quality. 

Traits determining meat quality are difficult to improve 
by traditional selection and breeding methods, because 
heritability is quite low (Davoli and Braglia, 2007), the 
methods are expensive and some are only possible after the 
animal has been slaughtered. However, knowledge of the 
number of genes involved in the qualitative characteristics 
of meat would provide new information that could be used 
in breeding strategies intended to improve meat quahty. 

Meat-quality is determined by various factors, thus, 
consideration of the various phenotypes with 
simultaneously restricted QTL regions is a useful way to 
evaluate meat quality. This may lead to 
an innovative approach of the GWAS of complex-traits 
including meat-quality. If the significant SNPs of this study 
and additional SNPs from other studies using the same 
method as this study are used in marker-assisted selection, it 
will contribute to improving the meat and carcass quahty in 
pig in a more efficient method than the traditional pig 
selection. 
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